Terminology as Knowledge in Answer Extraction
نویسندگان
چکیده
It is well known that one of the greatest hurdles in automatically processing technical documentation is the large amount of specific terminology that characterizes these domains. Terminology poses two major challenges to the developers of NLP applications: how to identify domain specific terms in the documents and how to efficiently process them. In this paper we will present methodologies that we have used to extract and bootstrap a terminological database and its usage in an answer extraction system. Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-19092 Originally published at: Rinaldi, F; Dowdall, J; Hess, M; Kaljurand, K; Koitand, M; Kahusk, N; Vider, K (2002). Terminology as knowledge in answer extraction. In: TKE-2002: 6th International Conference on Terminology and Knowledge Engineering, Nancy, France, August 2002 August 2002. Terminology as Knowledge in Answer Extraction Fabio Rinaldi , James Dowdall , Michael Hess , Kaarel Kaljurand, Mare Koit, Kadri Vider, Neeme Kahusk Institute of Computational Linguistics, University of Zürich Winterthurerstrasse 190, CH-8057 Zürich, Switzerland fdowdall,hess,[email protected] Research Group of Computational Linguistics, University of Tartu J. Liivi 2, 50409 Tartu, Estonia fnkahusk, kaarel, koit, [email protected]
منابع مشابه
A Comparative Study of the Effect of Word Segmentation On Chinese Terminology Extraction
Automatic term extraction is the first step towards automatic or semi-automatic update of existing domain knowledge base. Most of the researches applied word segmentation as a preprocessing step to Chinese term extraction. However, segmentation ambiguity is unavoidable, especially in identifying unknown words for Chinese. In this paper, we discuss the effect and limitations of segmentation to C...
متن کاملMeasuring the Total Knowledge of English Sports Terminology by the Physical Education Students at Saudi Universities
Background. Given the importance of English a global language and one of the four official languages used in FIFA, it is essential to acquaint students who are specialised in Physical Education with English sport terminology. Objectives. This study aims to determine the knowledge level of physical education students in Saudi universities’ faculties and departments in relation to English sports...
متن کاملNLP for Answer Extraction in Technical Domains
In this paper we argue that questionanswering (QA) over technical domains is distinctly different from TREC-based QA or Web-based QA and it cannot benefit from data-intensive approaches. Technical questions arise in situations where concrete problems require specific answers and explanations. Finding a justification of the answer in the context of the document is essential if we have to solve a...
متن کاملJATE 2.0: Java Automatic Term Extraction with Apache Solr
Automatic Term Extraction (ATE) or Recognition (ATR) is a fundamental processing step preceding many complex knowledge engineering tasks. However, few methods have been implemented as public tools and in particular, available as open-source freeware. Further, little effort is made to develop an adaptable and scalable framework that enables customization, development, and comparison of algorithm...
متن کاملTechnical Terminology as a Critical Resource
Technical documentation is riddled with domain specific terminology which needs to be detected and properly organized in order to be meaningfully used. In this paper we describe how we coped with the problem of terminology detection for a specific type of document and how the extracted terminology was used within the context of our Answer Extraction System.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002